Automatic Detection and Removal of Disfluencies from Spontaneous Speech

نویسندگان

  • Mayank Kaushik
  • Matthew Trinkle
  • Ahmad Hashemi-Sakhtsari
چکیده

Unlike rehearsed and prepared speech, spontaneous speech contains high occurrence of disfluencies, like repetitions, filled pauses, and hesitations. Disfluencies can seriously hamper the word recognition accuracy of an Automatic Speech Recogniser (ASR), by increasing word insertion and deletion and rejection rates. In this paper we introduce signal processing algorithms to automatically identify and remove repetitions and filled pauses from spontaneous speech before passing it to an ASR for transcription. The algorithms are tested with Dragon NaturallySpeaking Speech Recogniser and show significant improvements in the word recognition accuracy, and ensuing reductions in substitution and deletion and insertion errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Detection of Sentence Boundaries, Disfluencies, and Conversational Fillers in Spontaneous Speech

Automatic Detection of Sentence Boundaries, Disfluencies, and Conversational Fillers in Spontaneous Speech

متن کامل

Automatic detection and annotation of disfluencies in spoken French corpora

In this paper we propose a multi-step system for the semiautomatic detection and annotation of disfluencies in spoken corpora. A set of rules, statistical models and machine learning techniques are applied to the input, which is a transcription aligned to the speech signal. The system uses the results of an automatic estimation of prosodic, part-of-speech and shallow syntactic features. We pres...

متن کامل

Detection of filled pauses in spontaneous conversational speech

Most automatic speech recognition work has concentrated on read speech, whose acoustic aspects differ significantly from speech found in actual dialogues. A primary difference between read speech and spontaneous speech concerns a high rate of disfluencies (e.g., filled pauses, repetitions, repairs, false starts). Filled pauses (e.g., “uh,” “um”), unlike silences, resemble phones as part of word...

متن کامل

Disfluent Lengthening in Spontaneous Speech

We investigate lengthening in spontaneous speech with the aim in mind to use it as a time-management strategy in incremental spoken dialogue systems. lengthening is a common feature of speech, occurring regularly near the edges of intonation phrases. It behaves similar to disfluencies when it occurs in places remote from phrasal boundaries. Disfluencies have proven useful in incremental spoken ...

متن کامل

Handling Disfluencies in Spontaneous Language Models

In automatic speech recognition, a stochastic language model (LM) predicts the probability of the next word on the basis of previously recognized words. For the recognition of dictated speech this method works reasonably well since sentences are typically well-formed and reliable estimation of the probabilities is possible on the basis of large amounts of written text material. However, for spo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010